Nicer model hashes; `model_hash_to_model` method #86

dilpath · 2024-03-26T18:06:33Z

Model hashes are now {MODEL_SUBSPACE_ID}-{MODEL_SUBSPACE_INDICES}-{PETAB_HASH}.

This makes possible a petab_select_problem.model_hash_to_model method, that converts a model hash into a model.

For parameters with > 10 options, alphabet characters will be used (similar to hex, but now 62 characters from [0-9][A-Z][a-z]. For parameters with > 62 options, the actual index delimited by . will be used.
e.g.
[estimate, 0, estimate, 0, 0] (values) -> [1,0,1,0,0] (indices) -> model_subspace_A-10100-petab_hash (hash)
[1, 36, 0] (indices) -> model_subspace_A-1Z0-petab_hash (hash)
[1, 63, 0] (indices) -> model_subspace_A-1.63.0-petab_hash (hash)

Also possible to just go with the last option with .s for simpler implementation and broad applicability. However, I expect most users will have parameters that can only be one of (0, estimate}, in which case all indices will be one of {0, 1}, so the more compact first representation is probably preferred.

The PEtab hash is computed from only the following information:

absolute location of the PEtab problem YAML file in the filesystem
nominal values of parameters in the model's PEtab problem
estimated parameters in the model's PEtab problem

dweindl

Overall, very convenient, thx.

But wait, is it really model_subspace_A-1.63.0? I think it produces model_subspace_A.1-63-0.

I am not sufficiently familiar with the matter to tell whether there can be any collisions or not.

Some docstrings + return type annotations would be great.

petab_select/model.py

dweindl · 2024-03-26T20:31:49Z

Also possible to just go with the last option with .s for simpler implementation and broad applicability. However, I expect most users will have parameters that can only be one of (0, estimate}, in which case all indices will be one of {0, 1}, so the more compact first representation is probably preferred.

No strong opinion. Let's see it in action and decide then.

Co-authored-by: Daniel Weindl <[email protected]>

dilpath · 2024-03-26T22:30:48Z

But wait, is it really model_subspace_A-1.63.0? I think it produces model_subspace_A.1-63-0.

🙈 Thanks! I doubt this case in the current implementation will ever be used, I've never heard of anyone using more than maybe 3 or so options for a parameter. Not sure whether PEtab Select would even handle this case well.

I am not sufficiently familiar with the matter to tell whether there can be any collisions or not.

It should be that every model subspace ID is unique in the context of a single PEtab Select problem, which also means that every model subspace ID exists in only one row of one model space file, of possibly many model space files. So, a model_subspace_id with its model subspace indices should correspond to one model in the full model space, at most.

It could be that multiple model subspace ID + indices combinations correspond to the same "model" (same mathematical model structure and estimated parameters), due to e.g. model subspace rows in the model space file that match in everything except their unique model subspace ID. This would mean that the "same" model that exists in different subspaces could be calibrated multiple times (once per duplicate) during model selection. But, this would always be under a unique hash, so no collisions at least.

No strong opinion. Let's see it in action and decide then.

Sounds good!

dweindl · 2024-03-27T06:59:29Z

It should be that every model subspace ID is unique in the context of a single PEtab Select problem

Thanks for the explanation. I think it would be good to stress in the docs that those hashes are context-dependent (which is usually not the case for hashes). I.e., after modifying a given PEtab select problem, a certain hash may identify a different model. This shouldn't really be a practical limitation, but users need to be aware.

petab_select/model.py

Co-authored-by: Daniel Weindl <[email protected]>

make model hashes nicer; add model hash to model method

04f9034

dilpath requested a review from dweindl March 26, 2024 18:06

dilpath mentioned this pull request Mar 26, 2024

Select: problem-specific minimize method for SaCeSS ICB-DCM/pyPESTO#1339

Merged

Dilan Pathirana added 2 commits March 26, 2024 19:23

clean

1a6d881

black

43478a0

dweindl reviewed Mar 26, 2024

View reviewed changes

petab_select/model.py Outdated Show resolved Hide resolved

petab_select/model.py Outdated Show resolved Hide resolved

petab_select/model.py Outdated Show resolved Hide resolved

dilpath and others added 4 commits March 26, 2024 23:04

Apply suggestions from code review

bb120cc

Co-authored-by: Daniel Weindl <[email protected]>

review

36e72a2

fix delimiters

bfd6cb5

doc

bf1bf17

dweindl approved these changes Mar 27, 2024

View reviewed changes

petab_select/model.py Outdated Show resolved Hide resolved

dilpath and others added 6 commits March 27, 2024 10:38

Update petab_select/model.py

a63b8e7

Co-authored-by: Daniel Weindl <[email protected]>

doc hash uniqueness

714c86d

:class:ModelHash

36a1503

typo

b693ebf

typo

96f0022

support user-supplied models with no specified subspace

9ab8a23

dilpath merged commit 5bda113 into develop Oct 6, 2024
3 checks passed

dilpath deleted the model_hashes_nicer branch October 6, 2024 19:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nicer model hashes; `model_hash_to_model` method #86

Nicer model hashes; `model_hash_to_model` method #86

dilpath commented Mar 26, 2024 •

edited

Loading

dweindl left a comment

dweindl commented Mar 26, 2024

dilpath commented Mar 26, 2024

dweindl commented Mar 27, 2024

Nicer model hashes; model_hash_to_model method #86

Nicer model hashes; model_hash_to_model method #86

Conversation

dilpath commented Mar 26, 2024 • edited Loading

dweindl left a comment

Choose a reason for hiding this comment

dweindl commented Mar 26, 2024

dilpath commented Mar 26, 2024

dweindl commented Mar 27, 2024

Nicer model hashes; `model_hash_to_model` method #86

Nicer model hashes; `model_hash_to_model` method #86

dilpath commented Mar 26, 2024 •

edited

Loading